Simplification of Intermediate Results during Intersection of Multiple Weighted Automata

نویسنده

  • Anssi Yli-Jyrä
چکیده

This paper presents a new simplification method for weighted finite automata. The method suggests efficient approaches to some problems that are related to intersection of automata. Minimal automata can be made smaller through simplifications such as merging of states. Although such simplifications change the recognized language, some other essential properties of the automaton may be preserved. For example, if we want to count, in a deterministic automaton, the strings whose length is below some limit, we can first merge some states and then count the number of strings using the simplified result that may be smaller than a minimal automaton [3]. The idea of various simplifications that preserve certain essential properties lends itself also to the case where the strings are described conjuctively, by an intersection of multiple automata. State merging when computing properties of an intersection can reduce the overal time complexity, and it resembles the projection operation of relation tables that is used in query optimization in modern database systems [5]. Our method assumes that an intersection of multiple minimal automata is carried out through pairwise intersections of automata. Some properties in the automata to be intersected indicate redundancy that leads to simple optimizations that are possible especially in unweighted automata, but also in some classes of weighted automata:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NLP Applications Based on Weighted Multi-Tape Automata

This article describes two practical applications of weighted multi-tape automata (WMTAs) in Natural Language Processing, that demonstrate the augmented descriptive power of WMTAs compared to weighted 1-tape and 2-tape automata. The two examples concern the preservation of intermediate results in transduction cascades and the search for similar words in two languages. As a basis for these appli...

متن کامل

ar X iv : c s . C L / 0 40 60 03 v 1 2 J un 2 00 4 Algorithms for Weighted Multi - Tape Automata – XRCE Research Report 2004 / 031 – Andre Kempe

This report defines various operations and describes algorithms for weighted multi-tape automata (WMTAs). It presents, among others, a new approach to multi-tape intersection, meaning the intersection of a number of tapes of one WMTA with the same number of tapes of another WMTA, which can be seen as a generalization of transducer intersection. In our approach, multi-tape intersection is not co...

متن کامل

Algorithms for weighted multi-tape automata

This report defines various operations and describes algorithms for weighted multi-tape automata (WMTAs). It presents, among others, a new approach to multi-tape intersection, meaning the intersection of a number of tapes of one WMTA with the same number of tapes of another WMTA, which can be seen as a generalization of transducer intersection. In our approach, multi-tape intersection is not co...

متن کامل

Efficient Computation of the Relative Entropy of Probabilistic Automata

The problem of the efficient computation of the relative entropy of two distributions represented by deterministic weighted automata arises in several machine learning problems. We show that this problem can be naturally formulated as a shortest-distance problem over an intersection automaton defined on an appropriate semiring. We describe simple and efficient novel algorithms for its computati...

متن کامل

Implicitly Intersecting Weighted Automata using Dual Decomposition

We propose an algorithm to find the best path through an intersection of arbitrarily many weighted automata, without actually performing the intersection. The algorithm is based on dual decomposition: the automata attempt to agree on a string by communicating about features of the string. We demonstrate the algorithm on the Steiner consensus string problem, both on synthetic data and on consens...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005